-----------------------------------
Software & operating system
-----------------------------------

Software: STATA 15.1 MP

Operating system: Microsoft Windows 10

-----------------------------------
File list
-----------------------------------

0_DataClean.do               STATA file for data cleaning
1_FigTab.do                  STATA do-file for generating figures and tables
ASP_Unit_2005_2014_v2.dta    ASP data set*
USC_ATC4.dta                 Drug class data set*
WAC_Unit_1994_2014.dta       WAC data set* 
chemodata20140825-nme.csv    Anti-cancer drug data set from Horward et al (2015)
fda_applno.dta               FDA drug characteristics data set
fdandc_pack.dta              FDA application number data set
mms_rev.dta                  Medicare market share data set*
ndc_hcpcs_crosswalk.dta      NDC/HCPCS crosswalk 
partb_usc.dta                Part B class data set
ppi_pharma.dta               PPI data set

* Proprietary data sets not permitted to be shared 

-----------------------------------
Data dictionary
-----------------------------------

anda: dummy for Abbreviated New Drug Application (source: Food and Drug Administration)
approvaldate: approval date of the drug (source: Horward et al 2015)
billingunitformerlydrugformcode: drug unit (source: AnalySource)
brandname:  brandname of the drug (source: AnalySource)
chml_newmol: dummy for FDA new molecular entity (source: Food and Drug Administration) 
disease: primary indication (source: Horward et al. 2015)
dosageform: dosage form of the drug (source: AnalySource)
genericname: generic name of the drug (source: AnalySource)
genericnameindicator: specifies whether a product is a brand-named product or a generically named product (source: AnalySource)
innovatorindicator: identifies products that have a NDA, a Biologic License Application, or are the earliest product registered within a Clinical Formulation ID (source: AnalySource)
lyg: life years gained (source: Horward et al. 2015)
mdate: monthly date (source: AnalySource)
mms: Medicare market share (source: Centers for Medicare & Medicaid Services and IQVIA)
monthlycostreal: the real monthly cost of the drug (source: Horward et al. 2015)
nda: dummy for New Drug Application (source: Food and Drug Administration)
ndc: National Drug Code (source: AnalySource)
partb_cms: dummy for drugs reimbursed under Part B (source: Centers for Medicare & Medicaid Services)
ppi_pharma: producer price index: pharmaceutical and medicine manufacturing (source: Economic Research at the St. Louis Fed)
priority: dummy for drugs with FDA priority review (source: Food and Drug Administration)
retl70: dummy for drugs with > 70% retail sales (source: IQVIA) 
retl80: dummy for drugs with > 80% retail sales (source: IQVIA) 
strength: drug strength (source: AnalySource)
uscclassification: Uniform System of Classification (source: AnalySource)
wac_unit: wholesale acquisition cost per unit (source: AnalySource)

----------------------------------
Sources of proprietary data
-----------------------------------

* Drug list price data, including Wholesale Acquisition Cost (WAC), are available from a variety of sources. One leading data source is First Databank which we accessed through AnalySource (https://www.analysource.com/).

* One of the control groups was retail drugs. The supplementary appendix provides a list of drug classes which are primarily sold in a retail setting. One method of creating a retail classification is to use IQVIA data such as MIDAS (US only) or National Sales Perspectives (NSP) (https://www.iqvia.com). 

